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POLYNUCLEOTIDE SYNTHESIS AND LABELING 
BY KINETIC SAMPLING LIGATION 



CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims benefit from United 

States provisional patent applications No. 60/415,043, 
filed Sept. 30, 2002, and No. 60/446,184, filed Feb. 7, 
2003, the disclosures of which are incorporated herein by 
reference in their entireties. 

FIELD OF THE INVENTION 

[0002] The present invention is in the field of 

molecular biology, and relates to the construction of 
polynucleotides suitable for use as genetic probes and 
other purposes. More specifically, the invention relates 
to methods for the construction of polynucleotides by 
ligation of shorter oligonucleotides without the need to 
first purify the oligonucleotides and without the need 
for a polynucleotide template. The invention also 
relates to methods of ligating oligonucleotides to one or 
both ends of a single-stranded polynucleotide. 
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BACKGROUND OF THE INVENTION 

[0003] A variety of modern molecular biology 
techniques call for relatively long single-stranded 
probes . 

[0004] For example, in the genotyping methods 

described in WO 02/057491 and Hardenbol et al . , Nature 
Biotechnol. 21(6): 673-678 (2003), each probe includes at 
least two targeting domains complementary to the nucleic 
acid target and at least one universal priming site, and 
may additionally include further universal priming sites, 
one or more genotypic labels ("bar code" sequences), and 
other sequences: the probes can thus readily exceed 80 nt 
in length. Various types of multiplex polymerase chain 
reactions ("PCR"), RNA profiling methods and genetic 
expression scoring methods similarly call for long 
single-stranded ("ss w ) probes. 

[0005] In a variety of these methods, a plurality of 

long single-stranded probes are combined and used as a 
mixture, or library, in multiplex analyses. 

[0006] Current methods for constructing long single- 

stranded polynucleotides are cumbersome and inefficient, 
particularly so for preparing libraries comprising a 
plurality of such probes. 

[0007] Direct chemical synthesis of polynucleotides of 

the required length provides low overall yields and a 
high proportion of contaminating truncated 

oligonucleotides that lack one or more nucleotides at one 
or both of their termini. For example, even at 99% 
efficiency per coupling cycle, a synthesis of a 120-base 
oligonucleotide would be expected to yield only 3% full- 
length product, along with a 30 -fold molar excess of 
truncates. The low yield necessitates a large scale 
synthesis in order to obtain enough of the final product, 



which is expensive, and the high proportion of truncates 
necessitates purification of the full-length product. In 
addition to their low yield, such chemical synthesis 
methods require the separate synthesis of each of the 
probes, which must thereafter be combined for multiplex 
analysis . 

[0008] An alternative to direct chemical synthesis, 
PCR synthesis, is error-prone, even with proofreading 
polymerases having 3' to 5' exonuclease activity. 
Furthermore, amplification-based synthesis methods 
require a template polynucleotide that is complementary 
to the entire length of desired probe sequence. When 
probe-length template is unavailable from natural 
sources, the template typically must be synthesized 
chemically, recapitulating the problems and disadvantages 
that inhere in direct chemical synthesis of long 
oligonucleotides . 

[0009] Another alternative to direct chemical 
synthesis, Template-Directed Ligation (TDL) , involves 
ligation of a number of oligonucleotides that are aligned 
for ligation by hybridization to a polynucleotide 
template complementary to the entire length of the 
desired polynucleotide product. U.S. Pat. No. 5,998,175 
describes a TDL-based method using pentameric 
oligonucleotides, which are short enough to be ligated 
without first being purified from truncated contaminants, 
since truncations comprise only a small fraction of 
synthesis products for such short oligonucleotides. As 
with PCR, however, the requirement for a separate 
template, with a different sequence for each member of a 
series of polynucleotide probes, makes TDL impractical 
for synthesis of long single-stranded probes, 



particularly so for synthesis of an entire library of 
such probes . 

[0010] Methods that require full-length templates also 

present the problem of isolating the probe from a 
template of identical or near-equivalent length. 
[0011] Another ligat ion-based technique obviates the 
need for full-length template. In this method, two 
shorter substrate oligonucleotides are held in proper 
register for ligation by stable hybridization to a third 
oligonucleotide that is complementary to each of the 
substrate oligonucleotides at the ends to be ligated; the 
third oligonucleotide is of subtemplate length. 
[0012] Although such methods obviate the need for a 
full-length template, they require that the substrate 
oligonucleotides be purified from truncates prior to 
ligation: truncated contaminants, particularly those 
lacking only one or two terminal nucleotides or a 
terminal phosphate group at the end to be ligated, can 
stably form unproductive complexes with the third 
oligonucleotide, severely inhibiting ligation of the 
full-length oligonucleotides to form the desired full- 
length polynucleotide product. 

[0013] The need thus exists for an efficient, cost- 

effective method of constructing long single - stranded 
polynucleotides. The method should not require the prior 
construction of a separate full-length polynucleotide 
template for each desired probe. The method should 
permit use of oligonucleotides without their prior 
purification from truncated contaminant molecules. When 
possible, the method should exploit the fact that, in 
some analytical methods, the majority of the sequence in 
each probe molecule is the same, regardless of the 
genetic variant to be detected or locus to be analyzed, 



with sequence variation limited to specific well-defined 
portions within the probes. The method should be 
amenable to use in multiplex format where the entire 
range of sequences in the library can be created at the 
same time, in the same synthesis reaction, rather than in 
separate reactions. The method should exhibit high 
ligation efficiency, resulting in a high yield of the 
desired polynucleotide probes. 

SUMMARY OF THE INVENTION 

[0014] The present invention satisfies these and other 

needs in the art by providing a method for constructing a 
polynucleotide suitable for use as a genetic probe by 
ligating three shorter oligonucleotides. A scaffold 
oligonucleotide is provided that is complementary or 
partly complementary to all three oligonucleotides to be 
ligated. There is no requirement for a full-length 
template strand, i.e. the scaffold oligonucleotide is 
subtemplate in length. The scaffold oligonucleotide 
serves to properly align the three oligonucleotides for 
ligation, but does not extend the entire length of the 
desired ligation product. The ligation conditions are 
chosen so that annealing of outer oligonucleotides to the 
scaffold is unstable. Such conditions render the 
ligation reaction resistant to inhibition by nonligatable 
contaminants in oligonucleotide preparations, meaning 
that unpurified oligonucleotides can be used. 
[0015] In one embodiment, a first oligo, a central 

oligo and a second oligo to be ligated are annealed to a 
scaffold oligonucleotide that is complementary to the 
entire length of the central oligo but only complementary 
at its (the scaffold oligonucleotide's) ends to a short 
region at one end of each of the first and second oligos. 



The scaffold and central oligos are purified away from 
truncated and other ligat ion-defective forms present in 
the mixture of products from their synthesis reactions. 
The first and second oligos are used without 
purification, as the entire population of their 
respective oligonucleotide synthesis reactions, including 
truncates and other ligation-def ective contaminants. 
[0016] The ligation conditions, and the lengths of the 

regions of complementarity between the oligonucleotides, 
are chosen such that during ligation the scaffold is 
unstably annealed to the first and second oligos. Such 
unstable annealing represents a way of sampling the 
populations of truncate-containing synthesis products for 
the first and second oligonucleotides, in that duplexes 
form and dissociate rapidly under such conditions. Any 
duplexes formed with contaminating ligat ion-defective 
oligos will not be good substrates for ligation and will 
dissociate rapidly. Eventually the desired full-length 
synthesis products in the first and second 
oligonucleotide synthesis reaction mixtures will form 
duplexes with the scaffold and be ligated to the central 
oligonucleotide. Such ligation is a sampling process in 
that the ligase enzyme is able to selectively ligate the 
desired full-length synthesis products despite the 
presence of an excess of contaminants. A ligation 
reaction performed under such conditions may be referred 
to as a "sampling ligation." 

[0017] A ligase enzyme and appropriate cof actors, 
salts and buffers are added, and the reactions are 
incubated at an appropriate temperature until ligation is 
substantially complete. Optionally, the full-length 
ligation product may then be separated from the scaffold 
oligonucleotide and any other unligated oligonucleotides 



by denaturation and size-based separation, such as 
denaturing polyacrylamide gel electrophoresis (PAGE) or 
column chromatography. The desired full-length ligation 
product, constructed by ligation of the first, central 
and second oligonucleotides, is referred to as the 
product polynucleotide . 

[0018] The method may be performed by sequential 

addition of oligonucleotides or it may be performed in a 
single step. The regions of complementarity between the 
scaffold and the first or second oligos may vary, e.gr. 
from 5 to 10 nt long, and ligation may be performed at 
various temperatures, e.g. from 20°C to 60°C. The only 
requirement for unstable annealing between the scaffold 
and the first and second oligos is that the ligation 
temperature be high enough to prevent stable duplex 
formation. The method can be used with the products of a 
single oligonucleotide synthesis reaction for each of the 
first and second oligos, or may be used with a plurality 
of oligonucleotide syntheses, the full-length products of 
which differ in sequence from each other. The products 
of five to 100 separate oligonucleotide syntheses may be 
pooled to form the populations of first or second 
oligonucleotides to be sampled. 

[0019] The method of the present invention may also be 

used to extend a target single- stranded polynucleotide by 
ligating an oligonucleotide to the target 

polynucleotide's 3' end, its 5' end, or both. Separate 
scaffold oligonucleotides are used for the 3' and 5' end, 
making it possible to append different extending 
oligonucleotides at the opposing ends of the target 
polynucleotide. Ligation may be performed under 
conditions in which the scaffold oligonucleotide is 
unstably annealed to the target polynucleotide. 



[002 0] The oligonucleotides appended to the target 

polynucleotide may contain sequences useful in subsequent 
experimental manipulations, such as PCR primer binding 
sites, bar code sequences, or type IIS restriction 
endonuclease binding sites. 

[0021] The method may be also used to label either the 

5' or 3 ' end of an oligonucleotide, or a plurality of 
oligonucleotides, for example by adding a radioactive or 
fluorescent label . 

[0022] The method of the present invention may also be 

used to ligate two or more species of oligonucleotides in 
the presence of a up to a ten-fold molar excess of 
truncate forms of one or more of the species with less 
than a three-fold decrease in ligation efficiency when 
compared with the same ligation reaction performed 
without truncate forms. 

BRIEF DESCRIPTION OF THE FIGURES 

[0023] These and other objects and advantages of the 

present invention will be apparent upon consideration of 
the following detailed description, taken in conjunction 
with the accompanying drawings, in which like graphical 
representations refer to like structures throughout, and 
in which: 

[0024] FIG. 1 is a schematic representation of several 

previous methods used to construct polynucleotides. 
FIG. 1A shows the method of U.S. Pat. No. 5,998,175; 
FIG. IB shows the method of U.S. Pat. No. 6,110,668; 
FIG. 1C shows the method of U.S. Pat. No. 5,380,831; and 
FIG. ID shows the method of Chalmers et al . , 2001, 
BioTechniques 30: 249. In this and all schematic 
representations herein, pairs of vertical dashed lines 
aligned co-linearly are intended to indicate base pairing 



generally, and the number of dashed lines is not intended 
to reflect any specific number of basepairs. The 
schematics also are not intended to accurately reflect 
the relative lengths of the oligonucleotides or 
polynucleotides depicted. In all schematics, unless 
otherwise indicated, the left end of the upper strands 
represents the 5' end, and the right end represents the 
3' end. The directionality of the bottom strands is 
reversed, with left ends representing 3' ends and right 
ends representing 5' ends. 

[0025] FIG. 2A is a schematic representation of a 

ligation method according to the present invention. 
[0026] FIG. 2B is a schematic representation of the 

principle of kinetic sampling ligation. Despite this 
schematic, the present invention is not limited by 
theory. 

[0027] FIG. 3 is a schematic of a pair of ligation 

reactions designed to test the effect of contaminating 
nonligatable oligonucleotides on full-length product 
formation in kinetic sampling ligation reactions. The 
number of oligonucleotide molecules depicted is not 
intended to accurately reflect their molar ratio in the 
reaction . 

[0028] FIG. 4A is a schematic of a method of extending 

both the 3' and 5' ends of a single-stranded 
polynucleotide according to the present invention. 
[0029] FIG. 4B is a schematic of the preparation of 

single stranded fragments from genomic DNA for random 
cloning. The fragments produced are suitable for 
extension according to the method illustrated in FIG. 4A. 
[0030] FIGS. 5A and 5B are schematics of methods of 

labeling the 3' end of an oligonucleotide according to 
the present invention. The label is depicted as attached 



to the 3' end of the labeling oligonucleotide, but may be 
attached at other locations on the labeling 
oligonucleotide provided that it does not interfere with 
the ligation reaction. 

[0031] FIG. 6 shows a polyacrylamide gel of the 

products of ligation reactions performed according to the 
present invention . 

[0032] FIG. 7 shows densi tometric scans of the lanes 

of an electrophoresis gel demonstrating the formation of 
adenylated oligonucleotides during ligation reactions. 

[0033] FIG. 8 shows a polyacrylamide gel of the 

products of a labeling reaction performed according to 
the present invention. 

[0034] FIG. 9 is a plot showing the results of a 

series of multiplex labeling reactions performed 
according to the present invention. 

[0035] FIG. 10 is a plot comparing the results of 

multiplex labeling reactions to singleplex labeling of 
the same oligonucleotides, each performed according to 
the present invention. 

DETAILED DESCRIPTION 

[0036] Several prior methods of constructing 

polynucleotides are schematically illustrated in FIG. 1. 
These prior methods are inefficient for producing entire 
libraries of different species of polynucleotides 
suitable for use as genetic probes. FIG. 1A illustrates 
an embodiment of the method of U.S. Pat. No. 5,998,175, 
and FIG. IB illustrates an embodiment of the method of 
U.S. Pat. No. 6,110,668, the disclosures of which are 
incorporated herein by reference in their entireties. 
These two methods require a full-length template strand 
upon which the complementary strand, i.e. the 



polynucleotide product, is constructed. FIG. 1C 
illustrates an embodiment of the method of U.S. Pat. No. 
5,380,831, and FIG. ID illustrates an embodiment of the 
method of Chalmers et al . , 2 001, BioTechniques 30: 24 9, 
the disclosures of which are incorporated herein by 
reference in their entireties. These latter two methods 
entail synthesis of a full-length template along with the 
desired polynucleotide product. All four methods require 
that the template strand be separated from the 
polynucleotide product and discarded before the 
polynucleotide product can be used as a single -stranded 
genetic probe . 

[0037] The methods illustrated in FIGS. 1A - ID also 

require that the oligonucleotides used to construct the 
polynucleotide product be purified prior to use, or that 
only very short oligonucleotides be used. Such 
purification is time-consuming and expensive. Methods 
requiring full-length templates and purification of 
oligonucleotides prior to ligation are ill-suited to 
construction of a library of a number of different 
species of polynucleotide for use as genetic probes, each 
of which is needed only in relatively small quantities. 
[0038] The present invention, in one aspect, provides 

a method for constructing a library of polynucleotides 
suitable for use as probes in genetic screening. The 
method involves kinetic sampling ligation of three 
shorter oligonucleotides, two of which may be unpurified, 
into a product polynucleotide suitable for use as a 
gene t ic probe . 

[0039] FIGS. 2A and 2B illustrate two aspects of the 

method. Referring to FIG. 2A, first and second oligos 
are ligated to the 5' and 3' termini, respectively, of a 
central oligonucleotide. Ligation of the three 
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oligonucleotides is performed in the presence of an 
oligonucleotide scaffold complementary to the entire 
length of the central oligonucleotide and extending 
beyond both the 5' and the 3' ends of the central 
oligonucleotide . 

[0040] The duplex formed by annealing the central and 
scaffold oligonucleotides is referred to as the core 
duplex. Core duplex may be formed by partially duplexing 
the central oligonucleotide with the scaffold 
oligonucleotide prior to contacting (e.gr. mixing) the 
central and scaffold oligonucleotides with the first and 
second oligonucleotides. (The terms contacting, mixing 
and combining are used synonymously herein.) 
Alternatively, core duplex may be formed in the presence 
of the first and second oligonucleotides. 

[0041] The bases of the oligonucleotide scaffold 

extending beyond the 5' end of the central 

oligonucleotide are complementary to the first oligo, and 
those extending beyond the 3' end of the central 
oligonucleotide are complementary to the second oligo. 
In the embodiment illustrated, when all three of the 
oligonucleotides to be ligated are base-paired to the 
scaffold oligonucleotide, no bases on the scaffold remain 
unpaired, and the ends of the first and second oligos are 
properly positioned for efficient ligation to the 5' and 
3' ends, respectively, of the central oligonucleotide. 
[0042] Oligonucleotides used in the ligation will 

typically be obtained by conventional solid-phase 
automated chemical synthesis, such as phosphoramidite 
synthesis, although oligonucleotides obtained in other 
ways may also be used. The method permits use of 
unpurified first and second oligonucleotides, i.e., crude 
populations of oligonucleotide synthesis reaction 
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products where there has been no attempt to separate and 
remove truncated or otherwise defective oligonucleotides 
from the desired full-length oligonucleotide synthesis 
product after the oligonucleotides have been cleaved from 
the solid support. Such mixtures of chemical synthesis 
reaction products are herein referred to as unpurified 
regardless of any steps, such as deprotection, washing 
and rinsing, that are performed prior to the cleavage 
from the solid support. Reaction products are also 
referred to as unpurified despite steps subsequent to 
cleavage from the solid support, such as desalting or 
washing, so long as such steps are not designed to remove 
truncated synthesis products. 

[0043] As used herein, the terms "oligonucleotide" and 
"polynucleotide" are not intended to reflect any 
particular number of nucleotides, and are used 
synonymously; both "oligonucleotide" and "polynucleotide" 
may refer to molecules of any length. The term "oligo," 
as used herein, is synonymous with "oligonucleotide." 
[0044] In most applications, the scaffold 

oligonucleotide will be chemically synthesized, and will 
be no longer than necessary to provide the desired 
complementarity with other oligonucleotides, as 
illustrated in FIG. 2A. In other embodiments, however, 
single-stranded oligonucleotides obtained from natural 
sources may be used as the scaffold oligonucleotide. 
Such scaffold oligonucleotides may comprise additional 
nucleotides at their 3' end, 5' end, or both. These 
additional nucleotides extend beyond the region of 
complementarity of the scaffold oligonucleotide to the 
first and second oligonucleotides, are not complementary 
to the sequence of the desired product of the ligation 
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reaction, and play no role in the synthesis of the 
polynucleotide product. 

[0045] A mixture of core duplex, first and second 
oligos, ligase, salts, buffers and cof actors is then 
incubated under conditions wherein the first oligo and 
the second oligo do not form stable duplexes with the 
scaffold oligonucleotide. In one embodiment, the central 
oligonucleotide and the scaffold oligonucleotide form a 
stable core duplex under these conditions. Whether the 
core duplex is stable or unstable under ligation 
conditions, however, is not an essential aspect of the 
present invent ion . 

[0046] Ligation under conditions that do not permit 

stable duplex formation between the core duplex and the 
first and second oligos is referred to herein as kinetic 
sampling ligation or simply sampling ligation, and is 
illustrated in FIG. 2B. For simplicity, sampling 
ligation is illustrated only for ligation of the second 
oligo, with the dotted box representing the relevant 
portion of the core duplex. An analogous process takes 
place with the first oligo at the other end of the core 
duplex (not shown) . 

[0047] As shown, an unpurified mixture of synthesis 
products for the second oligo, including truncates, is 
added to the core duplex, leading to complexes between 
core duplex and either the truncate or the full-length 
second oligo. The truncate complexes contain a gap, 
preventing them from being ligated, whereas the full- 
length complexes do not. Although only one truncate 
species, missing a single base, is illustrated in 
FIG. 2B, an unpurified mixture of oligonucleotide 
synthesis products may also contain truncate species 
missing two or more bases. Only full-length second 



oligonucleotides can be ligated to form the desired 
product polynucleotide. The unligated core duplex 
remaining in complexes with truncate oligonucleotides 
then re-equilibrates with the remaining truncate and 
full-length second oligos, again generating a mixture of 
truncate and full-length oligo complexes, as shown in the 
first re-equilibration step in FIG. 2B. Again, only 
complexes containing full-length second oligo can be 
ligated to form product polynucleotide. 

[0048] After a number of cycles of such ligation and 

re-equilibration, all or a substantial percentage of the 
core duplex is ligated to full-length second oligo to 
form the desired polynucleotide product, provided that an 
excess of full-length second oligo over core duplex is 
present. Although only three such re-equilibration steps 
are shown in FIG. 2B, the actual number of such steps is 
dependent on a number of experimental variables, 
including but not limited to the proportion of truncate 
contaminants in the oligonucleotide synthesis product 
mixture . 

[0049] The ligation reaction is allowed to proceed 

until the desired yield of full-length polynucleotide 
product is obtained, but typically no longer than is 
conveniently performed, typically hours or less. In 
terms of FIG. 2B, the reaction is allowed to proceed 
until enough cycles of ligation and re -equilibration take 
place to obtain the desired yield of polynucleotide 
product. The reaction conditions are selected to ensure 
that re-equilibration is sufficiently rapid that ligation 
reactions can be completed within a convenient period. 
The degree of completion of the reaction can be readily 
assessed by qualitative determination of the amount of 



full-length product polynucleotide produced, e.g., by gel 
electrophoresis . 

[0050] Ligation conditions include the temperature, 

salt concentration, ionic strength, presence or absence 
of molecular crowding agents, pH and all other factors 
that influence the stability of nucleic acid base 
pairing. The length of duplex formed between the first 
or second oligo and the scaffold is another variable 
influencing the stability of nucleic acid duplexes, and 
thus the rate of re-equilibration of the core duplex with 
truncate and full-length first and second oligos. 
Duplexes that re-equilibrate rapidly relative to the 
timeframe of the ligation reaction are referred to as 
unstable duplexes . 

[0051] The ligation of unstably duplexed oligos has 

significant advantages when ligating a mixture of 
oligonucleotides containing truncated forms of the full- 
length oligonucleotide. Such truncated forms are 
invariably created in the chemical synthesis of 
oligonucleotides, with truncates comprising a greater 
percentage of the total reaction products as the length 
of the oligonucleotide being synthesized increases. In 
prior methods, in which the presence of truncates 
inhibits the desired reaction, oligonucleotides over 
approximately twenty nucleotides must typically be 
purified away from contaminating truncates before use. 
Purification of such oligonucleotides is typically 
accomplished by preparative scale denaturing 
polyacrylamide gel electrophoresis, which is time and 
labor intensive, and thus expensive. 
[0052] In the methods of the present invention, 

unstable duplexes formed by unligatable contaminants in 
the populations of first and second oligonucleotide 



synthesis products will dissociate relatively rapidly 
with respect to the time of the ligation reaction (i.e., 
re _ e q U ilibration will take place) , and thus will not 
permanently block duplex formation by the full-length 
first or second oligo. Duplexes formed by the full- 
length first or second oligos will also be short-lived, 
but since ligation is irreversible, even relatively 
infrequent ligation of such transient "unstable" duplexes 
will eventually lead to the ligation of most, if not all, 
of the nucleic acid strands capable of being ligated, as 
illustrated in FIG. 2B. 

[0053] The sampling ligation reaction according to the 

present invention is dynamic: many short-lived duplexes 
between the core duplex and the first and second oligos 
form and dissociate during the course of the ligation 
reaction. It is this dynamic re-equilibration that 
prevents contaminating truncated oligos from forming 
long-lived duplexes, with half -lives comparable to the 
length of the ligation reaction, that would prevent the 
full-length oligo from binding. In this way, kinetic 
sampling ligation prevents truncated contaminants from 
inhibiting formation of the desired full-length ligation 
product. Example 1 herein below demonstrates the 
construction of a polynucleotide suitable for use as a 
genetic probe using one embodiment of a sampling ligation 
method of the present invention. An experimental method 
to determine whether duplex formation is "stable" or 
"unstable" is discussed below, illustrated in FIG. 3, and 
demonstrated in Example 3 . 

[0054] Unlike traditional ligation under stable duplex 

conditions, the presence of truncates does not prevent 
formation of the desired ligation product in the present 
invention. For this reason, it is possible with the 
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current invention to use populations of truncate- 
containing oligonucleotide synthesis products as the 
first and second oligos. Example 3 illustrates this 
feature of the invention. 

[0055] In one embodiment, ligation reaction conditions 

used in the methods of the present invention allow stable 
core duplex formation, and the temperature is low enough 
to be permissive for ligase activity. In another 
embodiment, the number of complementary bases between the 
central and scaffold oligonucleotides in the core duplex 
is greater than the number of complementary bases between 
the core duplex and either of the first or second oligos, 
such that a ligation temperature exists where the core 
duplex is stable but the complexes with the first and 
second oligos are unstable. 

[0056] In one embodiment of the invention, the central 

oligonucleotide is annealed to the oligonucleotide 
scaffold to form the core duplex in a first step prior to 
the addition of the first and second oligonucleotides. 
The core duplex so formed may be stored for subsequent 
use in several different synthesis reactions. 
[0057] Annealing, like hybridization, refers to the 

process of forming base pairs between complementary 
strands of nucleic acids. The annealed strands may be 
referred to as a duplex, and the process of creating a 
duplex may be referred to as duplexing. When fewer than 
all the bases of one or both strands are base paired the 
result is a partial duplex, and the process of creating a 
partial duplex may be referred to as partial duplexing. 
Annealing, base pairing, hybridizing and duplexing are 
used synonymously herein, as are anneal, basepair, 
hybridize and duplex. Annealing, base pairing, 
hybridizing and duplexing may be either stable or 



unstable. Unless otherwise indicated, complementary 
nucleic acid strands are assumed to form ant iparallel , 
rather than parallel, duplexes. 

[00 58] Formation of a common core duplex is 

particularly advantageous when a library of genetic 
probes can be designed to incorporate all sequence 
differences into the first and second oligos, so that the 
central and scaffold oligonucleotides are identical for 
all members of the probe library. All members of the 
library of probes can then be constructed using the same 
core duplex by simply changing the first and second 
oligos that are used. In another embodiment, a single 
species of central oligonucleotide may be annealed to a 
mixture of different scaffold oligonucleotides to form a 
family of core duplexes differing only in the single- 
stranded "sticky end" regions of the scaffold. 
[0059] The first oligo, the second oligo, or both, are 

then added to the core duplex. In one embodiment, both 
first and second oligos are present in at least a 5- fold, 
6-fold, 7-fold, 8-fold, 9-fold, even at least 10-fold 
molar excess over the central oligonucleotide in the 
ligation . 

[0060] Other embodiments of the invention involve the 

simultaneous addition of the first oligo, the second 
oligo, or both, along with the central and scaffold 
oligonucleotides. The order of addition of the first, 
second, central and scaffold oligonucleotide, however, is 
not an essential aspect of the present invention, and any 
order of addition falls within the scope of the 
invention . 

[0061] A ligase is then added, along with any salts, 

buffers and cof actors required by the ligase. Other 
embodiments involve addition of ligase, salts, buffers 



and cofactors at different points in the reaction, and 
the timing of the addition is not an essential aspect of 
the invention. The ligation may be effected by use of a 
ligase enzyme such as T4 DNA ligase, T7 DNA ligase, Taq 
DNA ligase, E. coli DNA ligase, or thermostable Pfu DNA 
ligase, or any enzyme from any other organism, or any 
synthetic enzyme, that is capable of joining 
oligonucleotides together. RNA ligases may also be used 
if the oligonucleotides to be ligated are substrates for 
RNA ligase. The temperature of the ligation can be at 
least 20°C, 25°C, 30°C, 35°C, 42°C, 50°C or 60°C. 
[0062] As mentioned above, relatively large quantities 

of core duplex comprising the central and scaffold 
oligonucleotides can be prepared in bulk and stored for 
use in several different reactions, and may be stored and 
aliquots thereof used many times. The central and 
scaffold oligonucleotides can be purified away from 
truncates prior to forming the core duplex. Since the 
central and scaffold oligonucleotides, in one embodiment 
of the present invention, are common to all members of a 
library of genetic probes, these two oligonucleotides can 
be purified with minimal expenditure of time and effort. 
For example, a library of 50 genetic probes would require 
50 different species of first oligo and 50 different 
species of second oligo, but only one species of central 
and one species of scaffold oligonucleotide. Of the 102 
different oligonucleotides required, only two would need 
to be purified. This represents a 98% savings in 
purification effort compared with prior methods that 
require purification of every oligonucleotide to prevent 
complications due to the presence of truncates. 
[0063] After ligation of the first, central and second 
oligonucleotides to form the full-length product 



polynucleotide, the scaffold oligonucleotide and other 
unligated oligonucleotides can be separated from the 
product polynucleotide, or not, depending on the intended 
subsequent use of the product polynucleotide. In one 
embodiment, size-based separation, such as denaturing 
PAGE, can be used to remove the scaffold and other 
oligonucleotides. The scaffold oligonucleotide is 
significantly shorter than the full-length product 
polynucleotide and is thus easily separated. Separation 
can also be effected by liquid chromatography under 
denaturing conditions. The central oligonucleotide can 
be f luorescently labeled to allow easy detection of the 
full-length product during purification. 

[0064] The present invention has numerous advantages 
when compared to prior methods of constructing 
polynucleotides, which are susceptible to interference by 
contaminating truncates present in the oligonucleotides 
being ligated. Many prior oligonucleotide synthesis 
methods rely on relatively low temperatures or extended 
regions of complementarity to ensure stable duplex 
formation during ligation. For example, U.S. Pat. No. 
5,380,831, illustrated in FIG. 1C, performs ligation of 
4-5 base pair duplexes at 15°C, where such duplexes are 
relatively stable, rather than 37 to 42°C. U.S. Patent 
No. 6,110,668, illustrated in FIG. IB, discloses a method 
of ligating 80 to 130 nucleotide long oligonucleotides at 
52°C. All 80 to 130 bases of these oligonucleotides are 
base paired during ligation so that even at 52 °C the 
duplex is stable. 

[0065] Prior ligation methods employed stable duplex 

conditions because it was expected that ligation would be 
more efficient if the complexes that act as substrates 
for ligation were long-lived. Short-lived complexes 
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would have been expected to ligate only inefficiently, so 
the unstable ligation conditions used in the present 
invention might have been expected to give little or no 
ligation product. Surprisingly, not only does unstable 
sampling ligation according to the present invention 
result in successful ligation, it has the unexpected 
advantage of preventing truncates from interfering with 
ligation of the desired full-length first and second 
oligos . 

[0066] The invention also does not require the 

availability, or the synthesis, of a full-length 
template, as do many prior methods. A template is a 
single-stranded polynucleotide that is complementary to 
the desired product strand along substantially the entire 
length of that product strand, and is therefore 
approximately the same length as the desired product. 
FIGS. 1A - ID illustrate the role of a full-length 
template in several such prior methods of polynucleotide 
construction . 

[0067] The scaffold oligonucleotide of the present 

invention is of subtemplate length: it does not extend 
along substantially the entire length of the desired 
polynucleotide product. The scaffold oligonucleotide is 
complementary to the full length of the central 
oligonucleotide but generally extends only modestly 
beyond the ends of the central oligonucleotide. Such 
single-stranded sticky ends will typically be short 
enough that duplexes with the first and second oligos can 
be made unstable under convenient laboratory conditions. 
Embodiments of the invention use single- stranded regions 
at the ends of the scaffold oligonucleotide of 5, 6, 7, 
8, 9, or 10 nucleotides. (Note that in some embodiments 
of the present invention, as discussed supra, additional 



nucleotides extend from one or both ends of the scaffold 
oligonucleotide that are not complementary to the desired 
full-length ligation product. Such scaffold 
oligonucleotides are nonetheless also referred to as 
"subtemplate" in length since the region relevant to the 
kinetic sampling ligation reaction, i.e. that region 
complementary to other oligos, does not extend 
substantially the entire length of the desired 
polynucleotide product. Such additional sequences at the 
ends of the scaffold oligonucleotide are irrelevant to 
the use of the oligonucleotide as a scaffold, provided 
that such sequences do not interfere with annealing of 
the oligonucleotides involved in the kinetic sampling 
ligation reaction.) 

[0068] In one series of embodiments of the methods of 

the present invention, entire libraries of 
oligonucleotides are synthesized simultaneously using a 
multiplex ligation. A common core duplex is used to 
create all members of the library of probes; accordingly, 
all sequence variations among members of the library are 
localized to the portions of the first and/or second 
oligos extending beyond the scaffold oligonucleotide. 
The first and second oligos, and thus the full-length 
product polynucleotide, may extend beyond the scaffold 
oligonucleotide at either end, or at both ends, by 10, 
15, 20, 25, 30, 50, 75 or more nucleotides. 
[0069] In some embodiments, any number of different 

first oligo species can be ligated to the central 
oligonucleotide in the presence of a single predetermined 
second oligo species, or any number of different second 
oligo species can be ligated to the central 
oligonucleotide in the presence of a single predetermined 
first oligo species. The resulting reaction products 



will comprise a product polynucleotide with a defined 
sequence at one end and with a number of sequence 
variants at the other end. 

[00701 For example, it is possible to perform 

multiplex ligation to create probes specific for several 
different genetic loci in the same ligation reaction. A 
first oligo species and a second oligo species are 
synthesized for each genetic locus to be analyzed. The 
first and/or second oligonucleotide can optionally 
include further sequences, such as one or more genotypic 
labels, commonly referred to as bar code tags. A bar 
code tag is a short sequence designed algorithmically to 
maximize discrimination on a microarray having 
complements of the respective tags; a 1:1 correspondence 
as between tag sequence and first and/or second 
oligonucleotide permits each oligonucleotide so labeled 
to be detected by detection of the bar code uniquely 
associated therewith. See, e.g., Shoemaker et al . , 
Nature Genet. 14(4):450-6 (1996); EP 0799897; Fan et al . , 
Genome Res. 10:853-60 (2000); and U.S. Patent No. 
6,150,516, the disclosures of which are incorporated 
herein by reference in their entireties. The first 
and/or second oligonucleotide can additionally and 
optionally include one or more amplification primer 
binding sites . 

[0071] All first and second oligos also comprise 
terminal regions complementary to regions at the 3' and 
5' ends, respectively, of the scaffold oligonucleotide. 
All first and second oligos are ligated to the common 
core duplex in the same reaction mixture. The resulting 
mixture of products will contain full-length genetic 
probes for each genetic locus for which first and second 
oligos are supplied. This embodiment is particularly 



well suited for simultaneous construction of probes for 
only a relatively modest number of genetic loci. 
[0072] In yet another embodiment, different scaffold 

oligonucleotides, rather than a common core duplex, are 
used. A common central oligonucleotide is annealed to a 
mixture of several different scaffold oligonucleotides, 
each of which is perfectly complementary to the entire 
length of the central oligonucleotide, but each of which 
differs from every other scaffold oligonucleotide at the 
single-stranded regions extending beyond the central 
oligonucleotide. A different scaffold oligonucleotide is 
included for each polynucleotide probe desired to be 
synthesized. The result is a mixture of different core 
duplexes . 

[0073] Several different pairs of first and second 

oligos can then be added, such that there exists in the 
final ligation mixture exactly one first oligo species 
and exactly one second oligo species for each species of 
scaffold oligonucleotide. After ligation there will be 
one full-length product polynucleotide for each scaffold 
oligonucleotide added. In embodiments in which the 
polynucleotide is designed to probe genetic loci, there 
will be one full-length probe species for each genetic 
locus of interest. This embodiment is typically 
preferred for simultaneous construction of probes for a 
large number of genetic loci. 

[0074] Alternatively, several variants of both the 

first and the second oligos for a given target genetic 
locus can be used. 

[0075] Since first and second oligos are used without 

purification in a number of embodiments of the present 
invention, it is actually a mixture of the products of an 
oligonucleotide synthesis reaction that is used, rather 



than a single molecular species. The terms first oligo 
and second oligo as used herein include such mixtures of 
products. A mixture of multiple species of first or 
second oligo comprises mixtures of synthesis reaction 
products for each species synthesized. The number of 
such mixtures of products of oligonucleotide syntheses 
that can be used in multiplex ligation reactions in 
various embodiments of the present invention ranges from 
one to 2, 5, 10, 25, 50, 75, 100 or more. 

[0076] In yet another embodiment, a plurality of first 

oligo species and a single second oligo species, or a 
plurality of second oligo species and a single first 
oligo species, are included for each of the multiple 
scaffold oligonucleotides. In embodiments in which the 
first and/or second oligonucleotide includes a region 
complementary to a genetic locus, the resulting mixture 
of products will comprise a library of probes for each 
locus of interest, such libraries having a single defined 
sequence at one end and a variety of sequences at the 
other end. 

[0077] All of the various species of full-length 

product polynucleotides of a multiplex ligation can be 
purified from shorter contaminants and unreacted 
oligonucleotides by denaturing PAGE or agarose gel 
electrophoresis, provided that the lengths of all first 
and second oligos are selected to give approximately the 
same length full-length product. Denaturing PAGE or 
denaturing agarose gel electrophoresis will effectively 
resolve all of the full-length products from shorter 
contaminants and unreacted oligonucleotides, despite the 
sequence differences among the various products. 
Purification of the desired full-length probe, however, 
is not essential to the invention. 



[0078] Whether a duplex formed by any particular pair 
of oligonucleotides is stable or unstable under any given 
set of ligation reaction conditions can be assessed by a 
ligation challenge experiment. The ligation challenge 
may be understood with reference to FIG. 3. Results of 
such a ligation challenge are presented in Example 3. 
[0079] To perform a ligation challenge reaction, a 

core duplex is first formed by annealing a central 
oligonucleotide with a scaffold oligonucleotide. Two 
ligation reactions are then performed, one with addition 
of just the second oligo and the other with addition of 
the second oligo and a ten-fold molar excess of 
nonligatable competitor oligo. FIG. 3 illustrates 
addition of a truncate form of the second oligo, missing 
one nucleotide from the 5' end, but an oligonucleotide 
lacking a 5' phosphate group (i.e., a 5' -OH form) may 
also be used. FIG. 3 only shows five unligatable 
competitors for one full-length second oligonucleotide 
but is intended to represent a ten-fold excess. Ligation 
is performed under the specific ligation conditions to be 
tested, e.g. 30 minutes at 37 - C - 42°C using T4 DNA 
ligase. The resulting ligation products are then 
analyzed by denaturing PAGE. If the presence of a ten- 
fold molar excess of truncates (or 5' OH 
oligonucleotides) decreases the amount of ligation 
product (comprising the central and second oligos) that 
is formed more than a factor of three, the ligation 
conditions support stable duplex formation between the 
second oligo and the scaffold. If the amount of ligation 
product formed is decreased by a factor of three or less 
in the presence of a ten- fold molar excess of truncates 
(or 5' OH oligonucleotides) , the ligation conditions 
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support unstable duplex formation between the second 
oligo and the scaffold. 

[0080] An analogous ligation challenge (not shown) can 
be performed for the first oligonucleotide by adding a 
ten-fold excess of truncates missing one nucleotide at 
the 3' end of the first oligonucleotide. 
[0081] Since the predominant contaminants in 

unpurified oligonucleotides are truncates, a showing that 
unstable duplexes are formed, as assessed by the ligation 
challenge test described here, suggests that such 
contaminants in unpurified oligonucleotide preparations 
will not significantly inhibit formation of the desired 
ligation product under the ligation conditions tested. 
Under such conditions it is therefore unnecessary to 
purify the oligonucleotides to be ligated, with 
consequent savings in time, effort and expense. 
[0082] The advantages of sampling ligation, i.e. 

ligation under conditions where first and second oligos 
are unstably duplexed to the scaffold, are demonstrated 
in Example 3. The example compares the ligation 
efficiency for first and second oligos with 7 or 18 bases 
at their ends complementary to the scaffold. Under the 
conditions of the reaction (37°C and 42 °C) a 7 basepair 
duplex would be expected to be far less stable than an 18 
basepair duplex. As shown in Table 1, infra, ligation of 
oligonucleotides with 18 complementary bases is inhibited 
by contaminating truncates under conditions where 
ligation of oligonucleotides with only 7 complementary 
bases is not significantly inhibited. This insensitivity 
to the presence of competing contaminants is a major 
advantage of ligation under unstable duplex conditions. 
[0083] In another aspect, the present invention 

relates to a kinetic sampling ligation method to extend a 
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single-stranded polynucleotide by ligating an 
oligonucleotide to the 3' end, the 5' end, or both. 
[0084] In another aspect, the invention relates to 

kits to perform such extension reactions. 
[0085] The portions of the single-stranded 

polynucleotide adjacent to and comprising the 3' and 5' 
ends are referred to as the 3 ' and 5' terminal regions of 
the single- stranded polynucleotide, respectively. These 
"extending oligonucleotides" may comprise binding sites 
for PCR primers, bar codes, type IIS restriction 
endonucleases, or any other nucleic acid sequence having 
a desired property. ("PCR," as used herein, refers to 
any method of duplex nucleic acid amplification 
regardless of the enzyme used.) As discussed in more 
detail below, extending oligonucleotides may also be 
labeled. 

[0086] In some embodiments, the extending 

oligonucleotide is the product of a single 
oligonucleotide synthesis reaction. In other 
embodiments, the extending oligonucleotide is a mixture 
of the products of a plurality of oligonucleotide 
syntheses, the full-length products of which differ in 
sequence from each other. The products of five to 100 
separate oligonucleotide syntheses may be pooled to form 
the extending oligonucleotide mixture. 

[0087] Type IIS restriction enzymes have distinct DNA 

binding and cleavage domains; they recognize a specific 
sequence but cleave a defined distance away. .Reviewed in 
Szybalski et al . , Gene 100:13-26 (1991), the disclosure 
of which is incorporated herein by reference in its 
entirety. Type IIS restriction enzymes useful for 
purposes of the present invention include, but are not 
limited to, Aarl , Acc36I, AclWI , Acul , Alw26I , Alwl , 



AlwXl, AsuHPI, BJbsI, Bbvl6I, Bbvl , BJbvII , Bed, BceAI , 
Beef I, Bcgl, BciVI, Bco35I , BcoKI , Bfil, Bful, BfuAI, 
BinI, Bmrl, Bpil , Bpml , BpuAI , BpuEI , Bsal , BscAI , 
Bse3DI, BseGI, BseMI , BseMI I, BseRI , BseXI , Bsgrl , BsmAI , 
BsmBI, BsirtFI, Bso31I, BsoMAI , Bsp6II, Bsp22I, Bsp28I, 
Bsp432I, BspCNI, BspMI , BspPI , BspVI , BsrDl , BsrEI , 
Bstei, Bstlll, BstFBI, BstVlI, BstV2I, Bthll, BtsI, 
Bsu6I, Crel, Eamll04I # Earl, Ecil , Eco31I, Sco57I, 
EC057MI, Ecoll2I, Ecol25I, Esp3I, Faul , Fokl , Fsfl , Gsul , 
Hgal, HinGUIl, HphI , Hpyll , Ksp632I , Ll al , Lwel , MboII , 
Mlyl, Mmel, Mnll , Ncul , NgoBI , NgroVIII, Plel, PpsI , 
Ral81, RleAX, Sapl , Seel, SchI , SfaNI, Smul , Sthl32I, 
St si, Taqll, Tcel , TspGVil , Tthllll, t7Jball09I # and 
VpaK32I . 

[0088] In one embodiment, illustrated at FIG. 4A, 

extending oligonucleotides comprising binding sites for 
first and second PCR primers are ligated to the 3' and 
the 5' ends of a single stranded target polynucleotide, 
respectively. For simplicity, only one species of ss 
polynucleotide to be extended is shown in FIG. 4A, but 
the method may be used with mixtures of many different 
species of ss polynucleotides. In the embodiment 
illustrated in FIG. 4A, the target ss polynucleotide 
("target") is combined (e.g. mixed) with a pre-formed 
first extending duplex (comprising a first extending 
oligo annealed to a first extending scaffold oligo) and a 
pre-formed second extending duplex (comprising a second 
extending oligo annealed to a second extending scaffold 
oligo) . In other embodiments, however, the individual 
oligonucleotides and polynucleotide may be added in any 
order, with or without pre-f ormation of any duplex 
species . 



[0089] The extending scaffold oligos illustrated in 
FIG. 4 A are complementary to the extending oligos along a 
portion of their (the extending scaffolds') length, and 
complementary to the target polynucleotide along another 
portion of their (the extending scaffolds 7 ) length, such 
that when the target and the extending oligos are base 
paired to their respective extending scaffolds there are 
no "gaps" in the base pairing at the junctions of the 
extending oligos and the target. Duplex formation with 
the extending scaffolds brings the target polynucleotide 
and the extending oligonucleotides into proper 
orientation and proximity for efficient ligation. Such 
oligonucleotides are properly aligned for ligation. The 
extending oligos are then ligated to the target 
polynucleotide, for example by addition of T4 DNA ligase 
and appropriate co-factors, and incubation for 30 minutes 
at 37-42°C. 

[0090] The scaffold oligonucleotide in FIG. 4A does 
not have bases beyond those complementary to either the 
extending oligos or the target. Such a scaffold 
oligonucleotide could be chemically synthesized. In 
other embodiments, however, additional bases that are not 
complementary to either the extending oligo or the target 
extend from the first and/or the second end of the 
scaffold oligonucleotide. In one embodiment, the 
scaffold oligonucleotide comprises a single -stranded 
fragment of genomic DNA. 

[0091] After ligation, the single-stranded 
polynucleotide product may be isolated and used directly, 
or the complement may be synthesized to give a 

(partially) double -stranded product. 

[00 92] In one embodiment, in which the extending 

oligos comprise binding sites for type IIS restriction 



endonucleases, the double -stranded ligation product in 
FIG. 4A may be digested with type IIS restriction 
endonucleases to give a double -stranded DNA fragment 
suitable for cloning. The use of a different extending 
sequence at the 5' end from that used at the 3' end makes 
it possible to introduce different restriction sites at 
the two ends of the molecule, facilitating directional 
cloning . 

[0093] In embodiments in which extending oligos 

comprise PCR binding sites, the double -stranded product 
illustrated in FIG. 4A may be amplified by PCR, or the 
isolated single-stranded product may amplified in a PCR 
reaction directly without the separate step of creating 
the duplex product . 

[0094] The use of a different extending sequence at 
the 5' end from that used at the 3 ' end in such 
embodiments makes it possible to have different PCR 
primer binding sites at the 5' and 3' ends. This reduces 
the chances of spurious intra- strand "hairpin" formation 
of the type that can result when the same primer binding 
sequence is present at both ends of DNA fragment to be 
amplified, in which the 3' end of each strand is 
necessarily the reverse complement of the 5' end. 
Single-stranded fragments can bend to allow the formation 
of a hairpin (antiparallel duplex between the 3' and 5' 
ends) that can interfere with subsequent manipulations, 
such as PCR amplification and restriction digestion. In 
addition, since hairpin formation depends on the length 
and sequence composition of the fragment, not all 
sequence fragments will be equally affected. This 
inequality can lead to skewing of results when a large 
mixture of fragments, differing significantly in the 
likelihood of hairpin formation, is used. Skewing is 



particularly harmful in procedures such as random cloning 
of genomic sequences (discussed with reference to 
FIG. 4B, infra) where it is desirable for the clone 
library to represent all the fragments of the genomic DNA 
being cloned equally. Fragments with different PCR 
primer binding sites at the ends are less subject to such 
hairpin-related artifacts. 

[0095] In embodiments in which extending oligos 

comprise bar code sequences, the product polynucleotide 
comprising any given target single-stranded 
polynucleotide sequence may be detected by hybridization 
to an array containing an oligonucleotide complementary 
to the unique bar code sequence ligated to the target 
polynucleotide of interest. 

[0096] FIG. 4B illustrates one practical application 

of the method illustrated in FIG. 4A, i.e., random 
cloning of genomic DNA. In the embodiment illustrated, a 
double -stranded DNA sample of interest is digested with 
DNase I for a time sufficient to give fragments of the 
desired size. The resulting randomly nicked double 
stranded DNA is then denatured to produce a mixture of 
single-stranded polynucleotides suitable for extension by 
the method illustrated in FIG. 4A. 

[0097] A different kind of scaffold oligonucleotide is 
used for random cloning than would be used for extension 
of a single, known target sequence. When a single known 
sequence is to be extended according to the method of 
FIG. 4A, the short sticky ends of the scaffold 
oligonucleotides are synthesized to be complementary to 
the known sequence of the target polynucleotide. When 
randomly cloning genomic DNA, however, the sequence at 
the ends of the DNase fragments to be extended is 
variable and generally not known. The scaffold 
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oligonucleotide is therefore synthesized with a mixture 
of all four bases at each position in the sticky end, to 
ensure that there is a complementary scaffold 
oligonucleotide for all possible sequences at the termini 
of the DNase fragment to be cloned. In one embodiment, 
the sticky end is seven nucleotides long, and thus each 
scaffold oligonucleotide mixture contains over 16,000 
different sticky end sequences. In other embodiments, 
the randomized sticky end may be four, five, six, eight, 
ten, twelve or more nucleotides long. 

[0098] The method of FIG. 4A is then followed using 

the mixture of scaffold oligonucleotides with randomized 
sticky ends, and using the mixture of genomic DNA 
fragments as the target polynucleotide. Extending 
sequences, optionally comprising type IIS restriction 
endonuclease binding sites, are ligated to the genomic 
DNA fragments. Complementary strands are synthesized, 
and the double-stranded product is digested with the 
appropriate restriction endonucleases . Such restriction 
fragments may be cloned into a vector in a defined 
orientation in embodiments in which the enzymes cleaving 
the two ends are different, providing termini after 
digestion that are different. 

[0099] In another aspect of the present invention, a 

kinetic sampling ligation method is used to label 
polynucleotides at either or both of the 5' or 3' ends. 

[0100] In another aspect, the invention relates to 

kits to perform such labeling reactions. 

[0101] When creating polynucleotides for use as 

genetic probes, it is often desirable to label the probe. 
However, existing methods used to label the 5' end differ 
significantly from the methods used to label the 3' end, 
owing to the inherent structural differences between the 



two ends. For example, a radioactive or fluorescent 
label can be enzymatically added at the 3' end of a 
polynucleotide using polymerase or terminal transferase 
reactions in the presence of labeled nucleotides. 
Labeling of the 5' end of a polynucleotide, in contrast, 
is frequently limited to radioactive labeling using a 
kinase reaction. Alternative chemical methods for 
labeling either the 5' or the 3' end of a polynucleotide 
exist, but such methods are often expensive and 
potentially damaging to the polynucleotide itself. 
[0102] Methods for labeling either or both of the 5' 
or 3' ends of a polynucleotide in accord with the present 
invention are illustrated in FIGS. 5A and 5B and 
demonstrated in Examples 4 and 5. FIGS. 5A and 5B 
illustrate several methods of the present invention for 
labeling the 3' end of a polynucleotide by sampling 
ligation. One of skill in the art would also be able to 
apply the same principle to adapt the method to label the 
5' end of a polynucleotide. Multiplex labeling methods 
are not illustrated, but are within the scope of the 
present invention and involve labeling a plurality of 
different oligonucleotide species simultaneously in the 
same labeling reaction. 

[0103] For the embodiments illustrated in FIGS. 5A and 

5B, the labeling reaction is comprised of 1) an 
oligonucleotide or polynucleotide to be labeled at its 3' 
end, 2) a labeling oligonucleotide which is already 
labeled with the desired label, preferably at its 3' end, 
and 3) a labeling scaffold oligonucleotide. The labeling 
scaffold oligonucleotide plays the same role as the 
scaffold oligonucleotide in the method illustrated in 
FIG. 2A and described above, in that it provides 
complementary bases to properly align the oligonucleotide 



to be labeled and the labeling oligonucleotide for 
ligation. 

[0104] In the method illustrated in FIG. 5A, the 

labeling scaffold has at its 3' end a region 
complementary to the oligonucleotide to be labeled long 
enough to form a stable duplex, and a region at its 5' 
end complementary to the labeling oligonucleotide that is 
not long enough to form a stable duplex under ligation 
conditions. In the method illustrated in FIG. 5B, the 
labeling scaffold has at its 5' end a region 
complementary to the labeling oligonucleotide long enough 
to form a stable duplex, and a region at its 3' end 
complementary to the oligonucleotide to be labeled that 
is not long enough to form a stable duplex under ligation 
conditions. In one embodiment, the labeling scaffold has 
a 7 nt region complementary to the labeling 
oligonucleotide and a 20 nt region complementary to the 
oligonucleotide to be labeled. In another embodiment, 
the labeling scaffold has a 7 nt region complementary to 
the oligonucleotide to be labeled and a 20 nt region 
complementary to the labeling oligonucleotide. 
[0105] The labeling oligonucleotide can be made 
detectable by labeling it in any way that facilitates 
detection, such as by attachment of a detectable label. 
[0106] Detectable labels include, e.g., a 

radionuclide, a fluorophore, a fluorescence resonance 
energy transfer ("FRET") tandem fluorophore, a FRET donor 
and/or acceptor, or a mass tag. Indirectly detectable 
labels include, e.gr., an enzyme, a genotypic label, or a 
hapten . 

[0107] The label may, for example, be a radionuclide, 

such as 33 P, 32 P, 35 S, and 3 H. 
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[0108] The label may instead be a fluorophore. 
Commercially available fluorescent nucleotide analogues 
readily incorporated into the labeling oligonucleotides 
include, for example, Cy3-dCTP, Cy3-dUTP, Cy5-dCTP, Cy3- 
dUTP (Amersham Biosciences, Piscataway, New Jersey, USA) , 
fluorescein- 12 -dUTP , tetramethylrhodamine - 6 -dUTP , Texas 
Red®-5-dUTP, Cascade Blue®-7-dUTP, BODIPY® FL-14-dUTP, 
BODIPY® ™R-14-dUTP, BODIPY® TR-14-dUTP, Rhodamine 
Green™-5-dUTP, Oregon Green® 4 88-5-dUTP, Texas 
Red®-12-dUTP, BODIPY® 63 0/650 - 14 -dUTP , BODIPY® 
650/665-14-dUTP, Alexa Fluor® 488-5-dUTP, Alexa Fluor® 
532-5-dUTP, Alexa Fluor® 568-5-dUTP, Alexa Fluor® 
594-5-dUTP, Alexa Fluor® 546-14-dUTP, fluorescein- 12 -UTP , 
tetramethylrhodamine-6-UTP, Texas Red®-5-UTP, Cascade 
Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-14-UTP, 
BODIPY® TR- 14 -UTP , Rhodamine Green™- 5 -UTP, Alexa Fluor® 
488-5-UTP, Alexa Fluor® 546-14-UTP (Molecular Probes, 
Inc. Eugene, OR, USA) . Protocols are available for 
custom synthesis of nucleotides having other 
fluorophores. Henegariu et al . , "Custom Fluorescent- 
Nucleotide Synthesis as an Alternative Method for Nucleic 
Acid Labeling," Nature Biotechnol . 18:345 - 348 (2000), 
the disclosure of which is incorporated herein by 
reference in its entirety. 

[0109] Other fluorophores available for post -synthetic 
attachment include, inter alia, Alexa Fluor® 350, Alexa 
Fluor® 532, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647, BODIPY 493/503, BODIPY FL, BODIPY R6G, BODIPY 
530/550, BODIPY TMR, BODIPY 558/568, BODIPY 558/568, 
BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY 
630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, 
Dansyl, lissamine rhodamine B, Marina Blue, Oregon Green 
488, Oregon Green 514, Pacific Blue, rhodamine 6G, 
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rhodamine green, rhodamine red, tetramethylrhodamine , 
Texas Red (available from Molecular Probes, Inc., Eugene, 
OR, USA), and Cy2 , Cy3.5, Cy5.5, and Cy7 (Amersham 
Biosciences, Piscataway, NJ USA, and others) . 
[0110] FRET tandem fluorophores may also be used, such 
as PerCP-Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, 
and APC-Cy7. 

[0111] Labels that are detectable by mass 
spectrometry, "mass tags," may also be used. Mass tags 
can be designed to provide hundreds of mass spectrally 
distinguishable species, allowing highly multiplexed 
reactions, and can be designed to be cleavable, typically 
photochemical ly cleavable, from the labeled nucleic acid, 
simplifying analysis. See, e.g., Kokoris et al . , Mol 
Diagn. 5(4):329-40 (2000) and Pusch et al . , 
Pharmacogenomics 3(4):537-48 (2002), the disclosures of 
which are incorporated herein by reference in their 
entireties . 

[0112] The labeling oligonucleotide may instead or in 
addition include at least one indirectly detectable 
label . 

[0113] For example, the oligonucleotide may include an 
enzyme . 

[0114] Enzymes useful for colorimetric detection are 
well known, and include alkaline phosphatase, 0- 
galactosidase, glucose oxidase, horseradish peroxidase 
(HRP) , and urease. Typical substrates for production and 
deposition of visually detectable products include 
o-nitrophenyl-beta-D-galactopyranoside (ONPG) ; 
o-phenylenediamine dihydrochloride (OPD) ; p-nitrophenyl 
phosphate (PNPP) ; p-nitrophenyl -beta-D-galactopyranoside 
(PNPG) ; 3 1 , 3 1 -diaminobenzidine (DAB); 3-amino-9- 
ethylcarbazole (AEC) ; 4-chloro-l-naphthol (CN) ; 
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5 -bromo-4-chloro-3-indolyl -phosphate (BCIP) ; ABTS® ; 
BluoGal; iodonitrotetrazolium (INT); nitroblue 
tetrazolium chloride (NBT) ; phenazine methosulf ate (PMS) ; 
phenolphthalein monophosphate (PMP) ; tetramethyl 
benzidine (TMB) ; tetranitroblue tetrazolium (TNBT) ; 
X-Gal; X-Gluc; and X-Glucoside. 

[0115] Enzymes may also be used for luminescent 

detection . 

[0116] For example, in the presence of hydrogen 

peroxide (H 2 0 2 ) , horseradish peroxidase (HRP) can catalyze 
the oxidation of cyclic diacylhydrazides , such as 
luminol , with subsequent light emission. Strong 
enhancement of the light emission can be produced by 
enhancers, such as phenolic compounds. Thorpe et al . , 
Methods Enzymol. 13 3:331-53 (1986); Kricka et al . , J. 
Immunoassay 17(l):67-83 (1996); and Lundqvist et al . , J". 
Biolumin. Chemilumin. 10(6):353-9 (1995), the disclosures 
of which are incorporated herein by reference in their 
entireties. Kits for such chemiluminescent and enhanced 
chemiluminescent detection of nucleic acids are available 
commercially . 

[0117] Labels can be attached to the end of the 

labeling oligonucleotide or internally, so long as the 
labels do not interfere with annealing of the labeling 
oligonucleotide to the labeling scaffold or the ligation 
of the labeling oligonucleotide to the oligonucleotide to 
be labeled. In one embodiment, ligation reactions are 
performed under conditions that permit stable duplex 
formation between the oligonucleotide to be labeled and 
the labeling scaffold, but not between the labeling 
oligonucleotide and the labeling scaffold, as discussed 
above in relation to FIG. 5A. In one embodiment, 
ligation is performed using T4 DNA ligase at 37°C - 42°C 
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for 30 minutes. The amount of product formed by ligation 
of the labeling oligonucleotide to the oligonucleotide to 
be labeled is then determined, for example by PAGE. 
[0118] In one embodiment, labeling reactions according 

to the present invention are performed in multiplex, i.e. 
with a plurality of oligonucleotide species to be labeled 
in a single ligation mixture. The plurality of 
oligonucleotide species comprises two or more 
oligonucleotide species, such as 24, 96 or more. 
[0119] The efficiency of labeling of oligonucleotides 

may optionally be assessed by hybridization to an array 
of complementary oligonucleotides or by any other 
suitable technique . 

[0120] Labeled oligonucleotides may optionally be 
purified by preparative denaturing PAGE or denaturing 
agarose gel electrophoresis. Preparative electrophoresis 
may be performed on a single labeled oligonucleotide, a 
plurality of oligonucleotides labeled in a multiplex 
reaction, or a mixture of the products of two or more 
labeling reactions . 

[0121] The following examples are provided by way of 
illustration, and not limitation. 



EXAMPLE 1 

Construction of Polynucleotides Suitable for Use as 
Genetic Probes Using Unpurified Products of a Single 
Oligonucleotide Synthesis Reaction 

[0122] A f luorescently labeled, gel purified 37 nt 
central oligonucleotide is annealed to a gel purified 
51 nt scaffold oligonucleotide to form a duplex over the 
entire length of the central oligonucleotide, with 7 nt 
remaining single-stranded at each end of the scaffold. 



These single-stranded 7 nt regions are referred to as 
sticky ends. 

[0123] A series of 48 pairs of first and second 
oligonucleotides are synthesized and used without 
purification. The 48 pairs of oligonucleotides vary in 
sequence, length, and purity. The length of the first 
oligonucleotide varies from 2 5 nt to 32 nt, and length of 
second from 51 nt to 58 nt . Based on an estimated 98% 
efficiency of single base addition during oligonucleotide 
synthesis, the yield of full-length product should be at 
most 52% for 32-mers and 30% for 58-mers, with the 
remainder being various truncated forms. However, in 
actual syntheses the yield of full-length product 
oligonucleotides can be significantly lower, and varies 
dramatically. Unless otherwise indicated, in this and 
other examples, all oligonucleotides are synthesized so 
that a phosphate group is present at the 5' terminus. 
[0124] Each full-length first oligo has at its 3' end 

a region of 7 nt complementary to the 7 nt sticky end at 
the 3' end of the scaffold oligonucleotide, and each 
full-length second oligo has at its 5' end a region of 
7 nt complementary to the 7 nt sticky end at the 5' end 
of the scaffold oligonucleotide. 

[0125] Separate ligation reactions are performed to 

ligate each of the 48 pairs of first and second oligos to 
the central oligonucleotide of the core duplex. First 
and second oligos are added simultaneously (1 /xM each) to 
a solution of pre- formed core duplex between the central 
and scaffold oligonucleotides (0.1 /xM each) in ligase 
buffer consisting of 50 mM Tris acetate (pH 7.6), 10 mM 
Mg acetate, 5 mM DTT, 2 5 /xg/ml bovine serum albumin 
(BSA) , and 1 mM ATP. All concentrations listed in this 
and other examples are concentrations in the final 



reaction mixture after all components have been added. 
The ligation reaction mixture is heated to 42 # C, T4 DNA 
ligase (New England Biolabs, Inc., Beverly, Mass., USA) 
is added to 8000 units/ml, and the reaction is allowed to 
proceed for 30 minutes. As used herein, a unit of ligase 
is a cohesive end ligation unit as defined by New England 
Biolabs. Reaction products are then resolved by 
denaturing 7M urea polyacrylamide gel electrophoresis. 
Bands are detected by scanning the gel with a fluorescent 
scanner. 

[0126] Results are shown in FIG. 6. Bands represent 

either unligated central oligonucleotide or central 
oligonucleotide ligated to first, second or both oligos, 
as indicated. The leftmost five lanes are control lanes 
loaded (from left to right) with unligated central 
oligonucleotide, central oligonucleotide ligated to one 
species of second oligo, central oligonucleotide ligated 
to a different species of second oligo, central 
oligonucleotide ligated to one species of first oligo, 
and central oligonucleotide ligated to a different 
species of first oligo. The lane to the right of the 
controls is empty, and is followed by 48 lanes loaded 
with products of ligation reactions performed with core 
duplex and a series of pairs of first and second oligos, 
different for each lane. Lanes to the right of those 
loaded with the 16th and 32nd ligation reaction mixtures 
are left unloaded. 

[0127] Forty six of the 48 pairs of first and second 

oligos are successfully ligated into full-length 
polynucleotide products, with little variation in the 
proportion of full-length product formed despite 
differences in the sequence, length and purity among the 



first and second oligonucleotide synthesis reaction 
product mixtures used. 



EXAMPLE 2 

Purification of the Full -Length Ligation Product 



[0128] Full-length product polynucleotides constructed 

according to the method of the present invention may 
optionally be purified away from other components of the 
ligation reaction, including excess of first and second 
oligonucleotides and ligation byproducts. 
Polynucleotides may be purified by denaturing 
polyacrylamide gel electrophoresis. The presence of urea 
in the denaturing gel interferes with detection of DNA 
using standard DNA staining agent such as ethidium 
bromide or SYBR @ Green (Molecular Probes, Inc., Eugene, 
Oregon, USA) unless the urea is removed by soaking the 
gel in water or buffer. The method presented involves 
direct visualization of the products of the reaction, 
allowing bands to be cut from the gel without the need to 
soak the gel to remove urea. 

[0129] In one experiment, 10 |iM of first and second 

oligonucleotides are mixed with 2 (aM of core duplex 
complex in a 50 p,L reaction volume. The reaction 
proceeds as described in EXAMPLE 1. An equal volume of 
formamide is added at the end of the reaction. This 
mixture is preheated for 2 min at 95 °C and then cooled on 
ice, and loaded onto a 10% preparative denaturing 
polyacrylamide gel. After separation is complete the gel 
is placed between layers of plastic cling wrap, placed 
over a fluorescent TLC plate and illuminated with 2 54 nm 
UV light from a handheld UV lamp. Polynucleotides are 
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visible due to shadowing of UV light, i.e. decreased 
fluorescence of the TLC plate beneath areas of the gel 
where the polynucleotides are located. Bands with 
mobilities corresponding to the full-length 
polynucleotide are visible in all ligation reactions 
described in EXAMPLE 1. 

[0130] If the central oligonucleotide is labeled with 

fluorescein it is possible, by placing the gel on a UV 
light source and detecting fluorescence, to detect lesser 
amounts of full-length polynucleotide than is possible 
with the method described above. Other effective 
combinations of illumination and filters can optionally 
be used to visualize the fluorescein label. Whatever the 
means of detection, bands corresponding to full-length 
polynucleotide products are cut from gel and extracted. 

EXAMPLE 3 

Inhibition of Ligation by Nonligatable Competing 
Oligonucleotides Under Stable Ligation Conditions and 
Unstable Ligation Conditions 

[0131] A core duplex with 7 nt sticky ends is created 
as in Example 1. A different core duplex with two 18 nt 
sticky ends is created by annealing the same central 
oligonucleotide with a 73 nt scaffold oligonucleotide. 
The sticky ends of both scaffold oligonucleotides are 
fully complementary to the 3' end of the first oligo and 
the 5' end of the second oligo. The core duplexes are 
formed by adding a two- fold molar excess of scaffold to 
the f luorescently labeled central oligonucleotide and 
heating to 80*C for 2 min. , followed by 46°C for an 
additional 30 minutes. Core duplexes are used at 
approximately 0.1 mM in the final ligation reactions. 
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[0132] First oligo, second oligo, both oligos or 
neither oligo are added to the core duplex with 7 nt 
sticky ends and, in separate reactions, to the core 
duplex with 18 nt sticky ends, in each case to a 
concentration of 1 txM (a ten- fold molar excess over the 
core duplexes) . The result is eight separate ligation 
reactions comprising one of the two core duplexes with 
either the first, the second, both or neither oligo (s) . 
[0133] An additional ligation reaction is performed, 

with conditions identical to each of those described 
above except that nonligatable competing oligonucleotides 
are added at a ten- fold molar excess (10 fiM) over the 
first and second oligos. These nonligatable oligos 
simulate contaminants. For the first oligo, the 
nonligatable competing oligo is identical to the full- 
length first oligo except that it lacks one nucleotide at 
the 3' end. This competing oligo is referred to as the 
"first (n-1) oligo." For the second oligo, the 
nonligatable competing oligo is identical to the full- 
length second oligo except that it lacks a phosphate 
group at the 5' end. This competing oligo is referred to 
as the "second (5' -OH) oligo." Reactions involving only 
the first oligo are challenged only with the first (n-1) 
oligo, and reactions involving only the second oligo are 
challenged only with the second (5' -OH) oligo. Reactions 
involving both the first and second oligos are challenged 
with both nonligatable competing oligos. Control 
ligations containing only nonligatable competing oligos 
are also performed, i.e. ligations like those mentioned 
previously except that no first or second oligo is added. 
[0134] Four hundred units of T4 DNA ligase are added 

to each 50 reaction. ATP and BSA are added to 1 mM 
and 25 Mg/ml final concentrations, respectively. 



Duplicates of all of the aforementioned ligation 
reactions are performed, one at 37 8 C and the other at 
42 P C. Time points are withdrawn at 10 and 3 0 minutes and 
stopped by addition of three volumes of formamide 
containing bromphenol blue. Samples are then loaded onto 
a 10% PAGE gel containing 7M urea. Gel results are 
quantified using a Molecular Dynamics (Amersham 
Biosciences) fluorescent scanner. After the scan, the 
intensity of each band is determined by integrating the 
area under the corresponding peak. The average of the 10 
and 30 minute results is presented in TABLE 1. 



TABLE 1 

Percent Formation of Full -Length Product Polynucleotide 



Sticky End Competing First Competing Second 
(Temperature) Oligonucleotide Oligonucleotide^ 



Both Competing 
Oligonucleotides 



7 nt (37°C) 

7 nt (42°C) 

18 nt (37°C) 

18 nt (42°C) 



100% 
100% 
22% 
32% 



83% 
88% 
8% 
7% 



77% 
91% 
(<2 .5%) 
3% 



[0135] Table 1 presents the amount of full-length 

ligation product obtained in reactions containing a ten- 
fold molar excess of nonligatable competing 
oligonucleotides as a percentage of the amount of full- 
length ligation product obtained in identical reactions 
lacking the competing oligonucleotides. At both 37°C and 
42°C / the reactions involving unstable duplex formation 
between the first (and second) oligos and the scaffold, 
i.e. those involving 7 nt sticky ends, are significantly 
less inhibited by the presence of simulated contaminants. 
A ten-fold excess of the first (n-1) oligo has no effect 
on ligation of the first oligo to the central 
oligonucleotide, and a similar excess of the second (5'- 



OH) oligo decreases ligation of the second oligo to the 
central oligonucleotide only 12-17%. The presence of 
both simulated contaminants reduces full-length product 
formation 19-23% for reactions involving unstable 7 nt 
sticky ends. 

[0136] In contrast, ligations involving 18 nt sticky 

ends are severely impaired by simulated contaminants. A 
ten-fold excess of the first (n-1) oligo reduces ligation 
of the first oligo to the central oligonucleotide by 68- 
78%, more than three-fold. A similar excess of the 
second (5' -OH) oligo decreases ligation of the second 
oligo to the central oligonucleotide from 12 to 14 -fold. 
The presence of both simulated contaminants reduces full- 
length product formation by 30 -fold for reactions at 
42°C, and to below the limits of detection (>40-fold) for 
reactions at 37°C. 

[0137] An additional and unexpected benefit of the use 
of unstable ligation is that very little adenylation of 
the central oligonucleotide occurs, whereas significant 
adenylation occurs under stable ligation conditions. 
Adenylation is an undesired side reaction that involves 
transfer of an AMP moiety from a ligase-AMP intermediate 
to the 5' end of one of the oligonucleotides to be 
ligated. In a successful ligation reaction, the AMP 
moiety is released upon formation of the phosphodiester 
bond between the two oligonucleotides, i.e. the ligation 
of the strands. However, under some conditions the 
ligation reaction fails to proceed to the formation of a 
phosphodiester bond, leaving an adenylated 5' end. Such 
adenylated strands can be identified on the basis of 
their distinctive mobility in PAGE. 

[0138] FIG. 7 shows densitometric scans of lanes from 

a gel loaded with ligation reaction products, 
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illustrating the formation of adenylated central 
oligonucleotide (indicated by the arrow) . The numbers 
refer to lanes on the gel. The leftmost peak in lanes 2, 
3, 6-9, 12 and 13 is the full-length polynucleotide 
product, and the rightmost peak in all lanes is the 
central oligonucleotide. Lanes 1-7 show the products of 
ligation reactions performed using a core duplex with a 7 
nt sticky end, which forms unstable duplexes with the 
first and second oligos under reaction conditions (37°C) . 
No appreciable quantity of adenylated central 
oligonucleotide is formed. Lanes 8-13 show the products 
of ligation reactions performed using a core duplex with 
an 18 nt sticky end, which forms stable duplex with the 
first and second oligos under reaction conditions. Only 
reactions involving stable duplex formation and including 
the truncated form of the first oligo, shown in lanes 10- 
13, show significant formation of the adenylated central 
oligonucleotide. Up to 80% of the central 

oligonucleotide is adenylated in reactions shown in lanes 
10-13. Once adenylated, such oligonucleotides cannot 
subsequently be ligated at their 5' end. 

[013 9] Without intending to be bound by theory, it is 
possible that the first (n-1) oligo in the 18 nt sticky 
end reaction, which can form a 17 bp duplex with the 
scaffold oligonucleotide, forms a stable complex, albeit 
with a single base gap between the ends that would 
otherwise be ligated. This gapped duplex may permit T4 
DNA ligase binding and AMP transfer to the 5' end of the 
central oligonucleotide (adenylation) , but not 
phosphodiester bond formation. The analogous complex 
between the first (n-1) oligonucleotide in the 7 nt 
sticky end reaction would be able to form only a 6 bp 
duplex, which may be too unstable to permit binding and 



AMP transfer by T4 DNA ligase. Whatever the molecular 
mechanism, the method of the present invention has the 
advantage that the central oligonucleotide is not 
irreversibly inactivated by adenylation when unstable 
ligation conditions (7 nt sticky ends) are used, as it is 
when stable ligation conditions (18 nt sticky ends) are 
used. 

EXAMPLE 4 

Labeling of the 3' or 5' Ends of Unpurified 
Oligonucleotides by Kinetic Sampling Ligation 

[0140] Six different unpurified oligonucleotides are 
labeled at their 3' ends by kinetic sampling ligation and 
six others are labeled at their 5' ends. FIGS. 5A and 5B 
show two methods of labeling the 3' end of an 
oligonucleotide. No schematic is presented for the 
analogous method for labeling the 5' end of an 
oligonucleotide . 

[0141] To label the 3' ends of the first six 

oligonucleotides, six separate 27 nt labeling scaffold 
oligonucleotides and one labeling oligonucleotide are 
synthesized. Each of the scaffold oligonucleotides 
contains a different 20 nt sequence at its 3' end 
complementary to the 3' end of one of the six 
oligonucleotides to be labeled, and a 7 nt common 
sequence at its 5' end complementary to the 5' end of the 
labeling oligonucleotide. The labeling oligonucleotide 
for 3' end labeling is 20 nt long and has fluorescein 
attached at its 3' end. 

[0142] To label the 5' end of the second six 

oligonucleotides (no schematic shown) , six separate 27 nt 
labeling scaffold oligonucleotides and one labeling 



oligonucleotide are synthesized. Each of the scaffold 
oligonucleotides contains a different 20 nt sequence at 
its 5' end complementary to the 5' end of one of the six 
oligonucleotides to be labeled, and a 7 nt common 
sequence at its 3' end complementary to the 3' end of the 
labeling oligonucleotide. The labeling oligonucleotide 
for 5' end labeling is 10 nt long and has fluorescein 
attached at its 5' end. 

[0143] Phosphate groups are chemically coupled to the 

5' ends of all oligonucleotides used in this example 
during synthesis, with the exception of those 
oligonucleotides that are labeled at their 5 ' end with 
fluorescein. No oligonucleotides used in this example 
are purified with the exception of the labeling 
oligonucleotides . 

[0144] In separate reactions, each of the six 

oligonucleotides to be labeled at their 3' ends, ranging 
in length from 52 nt to 58 nt , and each of the six 
oligonucleotides to be labeled at their 5' ends, ranging 
in length from 25 nt to 32 nt, is annealed to the 
corresponding labeling scaffold oligonucleotide. 
Oligonucleotides to be labeled are used at 0.1 |iM and 
labeling scaffold oligonucleotides are used at 0.5 |aM. 
Ligase buffer is added, and the 3' end or 5' end labeling 
oligos are added to their respective reactions to 0 . 2 |aM. 
Reactions are heated to 42°C and then 400 units of T4 DNA 
ligase is added to each 50 mL reaction. After incubation 
for 30 min. at 42 °C, reaction products are analyzed by 
10% PAGE. 

[0145] Results of the experiment are presented in 

FIG. 8. Lanes 1-6 are the products of the six 3' end 
labeling reactions, and lanes 7-12 are the products of 



the six 5' end labeling reactions. The lower clusters of 
bands in each lane are unreacted labeling 
oligonucleotide, and the dark upper bands in each lane 
are the desired labeled ligation products. All 12 
oligonucleotides are labeled equally well regardless of 
which end is being labeled. Since only oligonucleotides 
with perfect ends can be ligated, labeling by sampling 
ligation as described produces only one specific ligation 
product for each labeling reaction. Gaps will remain 
between any contaminating truncated oligonucleotides and 
the labeling oligonucleotide when annealed to the 
labeling scaffold oligonucleotide, preventing ligation, 
and thus labeling, of the truncates. 

EXAMPLE 5 

Multiplex Labeling of the 3' or 5' Ends of Unpurified 
Oligonucleotides by Kinetic Sampling Ligation 

[0146] In the multiplex aspect of the labeling 

reaction of the present invention, oligonucleotides are 
pooled and labeling is performed on the mixture 
(containing a plurality of oligonucleotides) in a single 
reaction, rather than on each oligonucleotide in a 
separate reaction. 

[0147] Over five hundred different oligonucleotides 
are labeled at their 3' ends in a series of 24-plex 
kinetic sampling ligation reactions. The procedure for 
multiplex kinetic sampling ligation labeling the 5' end 
is not described but is performed by analogy with 
Example 4 (above) . Design, annealing and ligation of 
oligonucleotides is performed as in Example 4, and as 
illustrated in FIG. 5B, with the following modifications 
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[0148] The labeling oligonucleotide is 21 nt long, 

with a fluorescent dye label at its 3' end. The scaffold 
oligonucleotide is 28 nt long, and its 5 ' end is 
complementary to the entire length of the labeling 
oligonucleotide. When annealed with the labeling 
oligonucleotide there remains a 7 nt single stranded 
region at the 3' end of the scaffold oligonucleotide. 
[0149] The oligonucleotides to be labeled are 28 bases 
long, with the 3 ' -most 7 nt of all oligonucleotides being 
identical. This 7 nt region is complementary to the 3'- 
most 7 nt of the scaffold oligonucleotide (here also 
termed a "labeling scaffold'). The result is that all 
oligonucleotides to be labeled differ in the 21 nt at 
their 5' end, but all can anneal to the same scaffold 
oligonucleotide at their 3' ends. 

[0150] The labeling oligonucleotide is annealed to the 

scaffold oligonucleotide in a 100 ill reaction containing 
27 mM labeling oligonucleotide, 32 yM labeling scaffold 
oligonucleotide and 4 . IX T4 probe buffer. 10X T4 probe 
buffer comprises 500 mM Tris Acetate pH 7.6, 100 mM Mg 
Acetate and 50 mM DTT. The annealing reaction is heated 
to 80 °C for two minutes, then 3 7°C for 3 0 minutes and 
finally chilled on ice. The duplex resulting from the 
pairing of the labeling oligonucleotide and the scaffold 
oligonucleotide is referred to as the labeling core 
duplex . 

[0151] The reaction is diluted to 10 mM labeling core 

duplex in 1 . 5X T4 probe buffer. A reaction is then 
prepared containing 4 jjlM labeling core duplex, 2 /iM total 
oligonucleotide to be labeled and IX T4 probe buffer. 
For a 24-plex labeling reaction, each individual 
oligonucleotide to be labeled is present at approximately 
83 nM. The reaction mixture is heated to 42°C, T4 DNA 
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ligase is added to 4 units/Ml/ and the reaction is 
incubated at 42 °C for 30 minutes. The ligation reaction 
is then heated to 70°C for 10 minutes and cooled to 4°C. 
The ligation produces a set of 49 nt oligonucleotides 
with fluorescent labels at their 3' ends and a 21 nt 
variable region at their 5' ends. The products of all of 
the 24-plex kinetic sampling labeling reactions are 
pooled and purified by PAGE. 

[0152] The relative efficiency of labeling of the 

separate oligonucleotide sequences is determined by 
hybridizing the mixture to an array of oligonucleotides. 
The array comprises a series of oligonucleotides 
immobilized at separate locations on a solid support. 
The sequences of the oligonucleotides bound to the array 
are chosen to be complementary to those in the mixture. 
The intensity of fluorescence signal at each location 
reflects the efficiency of labeling of the 
oligonucleotide complementary to the oligonucleotide 
immobilized at that particular location. 

[0153] The multiplex-labeled oligonucleotide sample is 

allowed to hybridize to the oligonucleotide array 
overnight in 6X SSPE (60 mM sodium phosphate, pH 7.4, 
6 mM disodium EDTA, 900 mM sodium chloride) at 40°C. The 
arrays are then washed and scanned to quantify 
fluorescence signal intensity at each location. 
[0154] FIG. 9 shows the results of the array 

hybridization, with fluorescence intensity, in arbitrary 
units, plotted for each sequence. Each sequence is 
assigned an arbitrary numerical designation for ease of 
presentation of the data. Not all numbers are used in 
the numerical designations, i.e. there are gaps in 
sequence numbering. FIG. 9 shows that signal intensity 
is relatively uniform for all sequences, indicating 
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relatively uniform labeling efficiency for the sequences 
examined . 

[0155] As a further test of the efficacy of 24-plex 
kinetic sampling ligation labeling, one group of 
oligonucleotides is labeled using both the single-plex 
and 24-plex methods. The products of the single-plex 
reactions are pooled and hybridized to any array of 
complementary oligonucleotides, as described above, as 
are the products of the 24-plex labeling. 

[0156] The signal intensity distribution for both sets 

of labeled oligonucleotides is shown at FIG. 10. The 
normalized signal is plotted for each oligonucleotide: 
normalized values are calculated by dividing the 
fluorescence signal intensity (in arbitrary units) for 
each oligonucleotide by the mean value for all 
oligonucleotides labeled the same way (i.e. singleplex or 
multiplex) . The distribution of signal intensities is 
comparable between the individually labeled 
oligonucleotides and those labeled in 24-plex, 
demonstrating that oligonucleotides can be satisfactorily 
labeled in 24-plex reactions with consequent savings in 
effort and expense. It is likely that more than 24 
oligonucleotides may also be labeled in multiplex. 
[0157] Unless specifically stated, no step of the 
method of this invention requires any particular order of 
addition of materials, or order of performance of steps. 
All patents, patent publications, and other published 
references mentioned herein are hereby incorporated by 
reference in their entirety as if each had been 
individually and specifically incorporated by reference 
herein . 

[0158] As used herein, reference to the presence of a 
specific number of oligonucleotides or sequences (e.g. 



one, two, etc.) refers to the number of different species 
present, rather than the number of molecules present. 
[0159] Examples are intended to illustrate the 

invention and do not by their details limit the scope of 
the claims of the invention. While preferred 
illustrative embodiments of the present invention are 
described, it will be apparent to one skilled in the art 
that various changes and modifications may be made 
therein without departing from the invention, and it is 
intended in the appended claims to cover all such changes 
and modifications that fall within the true spirit and 
scope of the invention. 



